Fine-grained Semantic Typing of Emerging Entities
نویسندگان
چکیده
Methods for information extraction (IE) and knowledge base (KB) construction have been intensively studied. However, a largely underexplored case is tapping into highly dynamic sources like news streams and social media, where new entities are continuously emerging. In this paper, we present a method for discovering and semantically typing newly emerging out-of-KB entities, thus improving the freshness and recall of ontology-based IE and improving the precision and semantic rigor of open IE. Our method is based on a probabilistic model that feeds weights into integer linear programs that leverage type signatures of relational phrases and type correlation or disjointness constraints. Our experimental evaluation, based on crowdsourced user studies, show our method performing significantly better than prior work.
منابع مشابه
No Noun Phrase Left Behind: Detecting and Typing Unlinkable Entities
Entity linking systems link noun-phrase mentions in text to their corresponding Wikipedia articles. However, NLP applications would gain from the ability to detect and type all entities mentioned in text, including the long tail of entities not prominent enough to have their own Wikipedia articles. In this paper we show that once the Wikipedia entities mentioned in a corpus of textual assertion...
متن کاملSANE: System for Fine Grained Named Entity Typing on Textual Data
Assignment of fine-grained types to named entities is gaining popularity as one of the major Information Extraction tasks due to its applications in several areas of Natural Language Processing. Existing systems use huge knowledge bases to improve the accuracy of the fine-grained types. We designed and developed SANE, a system that uses Wikipedia categories to fine grain the type of the named e...
متن کاملAutomatic Fine-Grained Semantic Classification for Domain Adaptation
Assigning arguments of verbs to different semantic classes (‘semantic typing’), or alternatively, checking the ‘selectional restrictions’ of predicates, is a fundamental component of many natural language processing tasks. However, a common experience has been that general purpose semantic classes, such as those encoded in resources like WordNet, or handcrafted subject-specific ontologies, are ...
متن کاملMulti-level Representations for Fine-Grained Typing of Knowledge Base Entities
Entities are essential elements of natural language. In this paper, we present methods for learning multi-level representations of entities on three complementary levels: character (character patterns in entity names extracted, e.g., by neural networks), word (embeddings of words in entity names) and entity (entity embeddings). We investigate state-of-theart learning methods on each level and f...
متن کاملRanking Entities in a Large Semantic Network
We present two knowledge-rich methods for ranking entities in a semantic network. Our approach relies on the DBpedia knowledge base for acquiring fine-grained information about entities and their semantic relations. Experiments on a benchmarking dataset show the viability of our approach.
متن کامل